Audio classification using dominant spatial patterns in time-frequency space

نویسندگان

  • Md. Khademul Islam Molla
  • Keikichi Hirose
چکیده

This paper presents a novel audio discrimination algorithm using spatial features in time-frequency (TF) space. Three types of audio signals – speech, music without vocal and music with background vocal are taken into consideration for classification. The audio segment is transformed into TF domain yielding the spatial illustration of energy. Nonnegative matrix factorization (NMF) is applied to TF space to extract a set of vectors which represents the dominant subspace of spatial energy distribution. The inverse Fourier transform is applied to individual dominant vectors to derive the features for audio discrimination. The classification is performed by using multiclass linear discriminant analysis (mcLDA). The experimental results show that the proposed algorithm is more noise robust and performs better than the recently reported methods.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Attenuation of spatial aliasing in CMP domain by non-linear interpolation of seismic data along local slopes

Spatial aliasing is an unwanted side effect that produces artifacts during seismic data processing, imaging and interpolation. It is often caused by insufficient spatial sampling of seismic data and often happens in CMP (Common Mid-Point) gather. To tackle this artifact, several techniques have been developed in time-space domain as well as frequency domain such as frequency-wavenumber, frequen...

متن کامل

Combining pattern recognition and deep-learning-based algorithms to automatically detect commercial quadcopters using audio signals (Research Article)

Commercial quadcopters with many private, commercial, and public sector applications are a rapidly advancing technology. Currently, there is no guarantee to facilitate the safe operation of these devices in the community. Three different automatic commercial quadcopters identification methods are presented in this paper. Among these three techniques, two are based on deep neural networks in whi...

متن کامل

Studying the Monthly Effect on the Market Reactions Using Time-Space -Frequency Analysis (Case Study: Tehran Stock Exchange)

Anomaly is an incident or event that cannot be explained by the dominant theories. Anomalies are situated in confronting with the efficient market theory, so that it provides conditions for stock trading strategies with additional returns in case of existing predetermined returns. Therefore, in this study, the anomaly due to monthly effects on the stock volume trading and the Tehran Stock Excha...

متن کامل

Newborn EEG Seizure Detection Based on Interspike Space Distribution in the Time-Frequency Domain

This paper presents a new time-frequency based EEG seizure detection method. This method uses the distribution of interspike intervals as a criterion for discriminating between seizure and nonseizure activities. To detect spikes in the EEG, the signal is mapped into the time-frequency domain. The high instantaneous energy of spikes is reflected as a localized energy in time-frequency domain. Hi...

متن کامل

Exploring the Patterns of In-Between Spaces in Guilan Historical Houses

The aim of this paper is to explain the spatial patterns of in-between spaces in Guilan historical houses in order to show their potential capacity in having various functions, and thus different forms, in the course of history. In-between spaces are mediators between two other spaces making them accessible or visible for each other. An explanation of their spatial patterns can both reveal the ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013